Title: 

1 TCGCGGCGGC CGTGATGGCT 
51 GCCGCGGCCC GGGAGCGGGC 
101 GCCCGACTCC ACCATGAACG 
151 ACATGGTGGA GTACAAACGG 
201 ACCCCCGTAG AGGGCGGGGC 
251 GGCTTCCCCT TTCTCACCAG 
301 CCAGGAGCTC TGGGCTGTTC 
351 ATCTCTGGCC TCTGCTCTAG 
401 ACAGCTGTTA GGCTCACGCA 
451 CTCTACTGCT GGCTGCACTG 
501 CAGTACCACA GAGACCCATC 
551 TCGAGTGGCT GGAAAAATCC 
601 GTGAGGACTT TTACCAGTTC 
651 CTGCCCGATG GGCGTTCTCG 
701 AAACCAGGCC ATACTGAAGC 
751 GCAGTGAAGC TGAGCAGAAG 
801 GTGGAGCGCA TTGAGGAGCT 
851 GAAGATTGGT GGTTGGAACA 
901 TGGAGGTGTT GAAGGCAGTA 
951 ACCGTCTACA TCAGTGCCGA 
1001 GGTGGACCAG TCTGGGCTCT 
1051 GAACTGCCAA TGAGAAAGTG 
1101 CTGGGGATGC TGCTGGGTGG 
1151 GCAGGTGCTG GAGTTGGAGA 
1201 ACCAGCGGCG CGACGAGGAG 
1251 CTGCAGGCTC TGGCGCCCTC 
1301 GCTGTCACCA TTGGAGTTGA 
1351 TGGATTATTT GCAGCAGGTG 
1401 ATCCTGAACA ATTACCTGAT 
1451 CCTGGACCGA CGCTTTGAGT 
1501 ATGGCACTAA GAAGTCCTGT 
1551 ACGGATGACG CCCTTGGCTT 
1601 GTTTGACCGG CAAAGCAAAG 
1651 GGACCGCATT TGAGGAGGCC 
1701 ACCCGCCAGG CAGCCAAGGA 
1751 TTTCCCAGAC TTTATCCTGG 
1801 GGTACGAAAT TTCTGAAGAT 
1851 AACTTCTCTG CCAAGGTTAT 
1901 AGACCAGTGG AGCATGACCC 
1951 CTAAGAATGA GATCGTCTTC 
2001 GCCCGCAACC ACCCCAAGGC 
2051 GGGCCATGAG TTGACGCATG 
2101 AAGAAGGGAA CCTGCGGCCC 
2151 CGGAACCACA CGGCCTGCAT 
2201 TGGGGAGAGG CTCAACGGCC 
2251 ACGGGGGGCT GAAGGCTGCC 
2301 CATGGGGAGG AGCAGCAACT 
2351 CTTCTTCGTG GGATTTGCCC 
2401 GCTCTCACGA GGGGCTGGTG 
2451 GTGCTGGGCA CTCTCTCCAA 
2501 CCCTGTCGGC TCCCCCATGA 
2551 CCTGGATCAG GGGAGAAATG 
2601 CCTGACAAAG CTGTTTGCTC 
2651 GCTGGGTCTA GTCCCTCCCC 
2701 CAATCACCAC ATTGTGCCTC 
2751 GCCCCCACCA TTCACTGTGA 
2801 GTCTGGGTGG GGAGGCCAGT 
2851 CCCAGGCTCA CTCAGCCTGG 
2901 TGTGACCCAC AGGCCTGGGT 
2951 ACTCAGTGCG CACTTAGGGG 
3001 GGGCTACCCC CACCTCACCC 
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GGTGACGGCG GGGCCGGGCA GGGGACCGGG 
CAGCTGCCGG GAGCCCTGAA TCACCGCCTG 
TCGCGCTGCA GGAGCTGGGA GCTGGCAGCA 
GCCACGCTTC GGGATGAAGA CGCACCCGAG 
CTCCCCGGAC GCCATGGAGG TGGGCAAGGG 
GCCCCAGCCC TGGCATGACG CCTGGCACAC 
TGGAGGGTCA CCTGCCCCCA CCTCCGCTCC 
GACTATGGTG GGATTCCAGA AGGGGACAAG 
CGCAGCTGGA GCTGGTCTTA GCAGGTGCCT 
CTTCTGGGCT GCCTTGTGGC CCTAGGGGTC 
CCACAGCACC TGCCTTACAG AGGCCTGCAT 
TGGAGTCCCT GGACCGAGGG GTGAGCCCCT 
TCCTGTGGGG GCTGGATTCG GAGGAACCCC 
CTGGAACACC TTCAACAGCC TCTGGGACCA 
ACCTGCTTGA AAACACCACC TTCAACTCCA 
ACACAGCGCT TCTACCTATC TTGCCTACAG 
GGGAGCCCAG CCACTGAGAG ACCTCATTGA 
TTACGGGGCC CTGGGACCAG GACAACTTTA 
GCAGGGACCT ACAGGGCCAC CCCATTCTTC 
CTCTAAGAGT TCCAACAGCA ATGTTATCCA 
TTCTGCCCTC TCGGGATTAC TACTTAAACA 
CTCACTGCCT ATCTGGATTA CATGGAGGAA 
GCGGCCCACC TCCACGAGGG AGCAGATGCA 
TACAGCTGGC CAACATCACA GTGCCCCAGG 
AAGATCTACC ACAAGATGAG CATTTCGGAG 
CATGGACTGG CTTGAGTTCC TGTCTTTCTT 
GTGACTCTGA GCCTGTGGTG GTGTATGGGA 
TCAGAGCTCA TCAACCGCAC GGAACCAAGC 
CTGGAACCTG GTGCAAAAGA CAACCTCAAG 
CTGCACAAGA GAAGCTGCTG GAGACCCTCT 
GTGCCGAGGT GGCAGACCTG CATCTCCAAC 
TGCTTTGGGG TCCCTCTTCG TGAAGGCCAC 
AAATTGCAGA GGGGATGATC AGCGAAATCC 
CTGGGACAGC TGGTTTGGAT GGATGAGAAG 
GAAAGCAGAT GCCATCTATG ATATGATTGG 
AGCCCAAAGA GCTGGATGAT GTTTATGACG 
TCTTTCTTCC AAAACATGTT GAATTTGTAC 
GGCTGACCAG CTCCGCAAGC CTCCCAGCCG 
CCCAGACAGT GAATGCCTAC TACCTTCCAA 
CCCGCTGGCA TCCTGCAGGC CCCCTTCTAT 
CCTGAACTTC GGTGGCATCG GTGTGGTCAT 
CCTTTGATGA CCAAGGGCGC GAGTATGACA 
TGGTGGCAGA ATGAGTCCCT GGCAGCCTTC 
GGAGGAACAG TACAATCAAT ACCAGGTCAA 
GCCAGACGCT GGGGGAGAAC ATTGCTGACA 
TACAATGCTT ACAAAGCATG GCTGAGAAAG 
GCCAGCCGTG GGGCTCACCA ACCACCAGCT 
AGGTGTGGTG CTCGGTCCGC ACACCAGAGA 
ACCGACCCCC ACAGCCCTGC CCGCTTCCGC 
CTCCCGTGAC TTCCTGCGGC ACTTCGGCTG 
ACCCAGGGCA GCTGTGTGAG GTGTGGTAGA 
CCCAGCTGTC ACCAGACCTG GGGCAGCTCT 
TTGGGTTGGG AGGAAGCAAA TGCAAGCTGG 
CCCACAGGTG ACATGAGTAC AGACCCTCCT 
TGCTTTGGGG GTGCCCCTGC CTCCAGCAGA 
CATCTTTCCG TGTCACCCTG CCTGGAAGAG 
TCCCATAGGA AGGAGTCTGC CTCTTCTGTC 
CGGCCATGGG GCCTGCCGTG CCTGCCCCAC 
GGTGTACCTC CTGGACTTCT CCCCAGGCTC 
TGGACTCAGC TCTGTCTGGC TCACCCTCAC 
TGTGCTCCTT GTGCCACTGC TCCCAGTGCT 
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3051 GCTGCTGACC TTCACTGACA GCTCCTAGTG GAAGCCCAAG GGCCTCTGAA 
3101 AGCCTCCTGC TGCCCACTGT TTCCCTGGGC TGAGAGGGGA AGTGCATATG 
3151 TGTAGCGGGT ACTGGTTCCT GTGTCTTAGG GCACAAGCCT TAGCAAATGA 
3201 TTGATTCTCC CTGGACAAAG CAGGAAAGCA GATAGAGCAG GGAAAAGGAA 
3251 GAACAGAGTT TATTTTTACA GAAAAGAGGG TGGGAGGGTG TGGTCTTGGC 
3301 CCTTATAGGA CCCTGTGCCA ATAAACAGAC ATGCATCCGT CAAAAAAAAA 
3351 AAAAAAAAAA AAAAAAAAAA AAAAAAA (SEQ ID NO:l) 

FEATURES : 

5'UTR: 1-113 

Start Codon: 114 

Stop Codon: 2547 

3'UTR: 2550 



Homologous proteins: 

Top 10 BLAST Hits 

CRA| 18000005141003 /altid=gi| 
CRA| 18000005084162 /altid=gi j 
CRA| 18000005012182 /altid=gi| 
CRA| 150000075554683 /altid=gi 
CRA| 1000682324124 /altid=gi | 5 
CRA| 108000024636251 /altid=gi 
CRA| 18000004932659 /altid=gi | 
CRA| 18000005060029 /altid=gi | 
CRA| 18000005155376 /altid=gi| 
CRA| 18000004985166 /altid=gi | 

BLAST dbEST hits: 



7662200 /def=ref |NP_055508.1| KIAA. 
2136744 /def=pir| | 146078 endotheli. 
1706565 /def=sp|Q1071l|ECE2_BOVIN . 
| 9789315 /def =gb | AAF98287 . 1 | AF2302 . 
821116 /def =dbj | BAA83687 . 1 | (AB031. 

12721007 /def=ref |XP_001827.2 | en. 
627989 /def=pir| |A53679 endothelin. 
1706564 / def =Sp | P42893 | ECE1_RAT EN. 
3287157 /def=emb|CAA19767.l| (AL03 . 
1082351 /def=pir| | JC2521 endotheli. 



Score 




E 


. 1550 


0 


.0 


1430 


0 


.0 


. 1430 


0 


.0 


. 1027 


0 


.0 


. 1001 


0 


.0 


. 1001 


0 


.0 


. 1000 


0 


.0 


996 


0 


.0 


995 


0 


.0 


995 


0 


.0 



gi | 6837875 /dataset=dbest /taxon=9606 /org=... 
gi j 5925169 /dataset=dbest /taxon=9606 ... 
gi j 10725997 /dataset=dbest /taxon=96... 
gi j 2162041 /dataset=dbest /taxon=9606 ... 



Score 
1094 
983 
519 
400 



0.0 
0.0 
e-144 
e-109 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

From BLAST dbEST hits: 

gi | 6837875 lung 

gi | 5925169 amygdala 

gi | 10725997 adrenal gland 

gi | 2162041 total fetus 

From tissue screening panels: 
Hippocampus 
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1 MNVALQELGA GSNMVEYKRA TLRDEDAPET PVEGGASPDA MEVGKGASPF 
51 SPGPSPGMTP GTPRSSGLFW RVTCPHLRSI SGLCSRTMVG FQKGTRQLLG 
101 SRTQLELVLA GASLLLAALL LGCLVALGVQ YHRDPSHSTC LTEACIRVAG 
151 KILESLDRGV SPCEDFYQFS CGGWIRRNPL PDGRSRWNTF NSLWDQNQAI 
201 LKHLLENTTF NSSSEAEQKT QRFYLSCLQV ERIEELGAQP LRDLIEKIGG 
251 WNITGPWDQD NFMEVLKAVA GTYRATPFFT VYISADSKSS NSNVIQVDQS 
301 GLFLPSRDYY LNRTANEKVL TAYLDYMEEL GMLLGGRPTS TREQMQQVLE 
351 LEIQLANITV PQDQRRDEEK IYHKMSISEL QALAPSMDWL EFLSFLLSPL 
401 ELSDSEPVW YGMDYLQQVS ELINRTEPSI LNNYLIWNLV QICTTSSLDRR 
451 FESAQEKLLE TLYGTKKSCV PRWQTCISNT DDALGFALGS LFVKATFDRQ 
501 SKEIAEGMIS EIRTAFEEAL GQLVWMDEKT RQAAKEKADA IYDMIGFPDF 
551 ILEPKELDDV YDGYEISEDS FFQNMLNLYN FSAKVMADQL RKPPSRDQWS 
601 MTPQTVNAYY LPTKNEIVFP AGILQAPFYA RNHPKALNFG GIGWMGHEL 
651 THAFDDQGRE YDKEGNLRPW WQNESLAAFR NHTACMEEQY NQYQVNGERL 
701 NGRQTLGENI ADNGGLKAAY NAYKAWLRKH GEEQQLPAVG LTNHQLFFVG 
751 FAQVWCSVRT PESSHEGLVT DPHSPARFRV LGTLSNSRDF LRHFGCPVGS 
801 PMNPGQLCEV W (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 



Number of 


matches : 


9 


1 


207-210 


NTTF 


2 


211-214 


NSSS 


3 


252-255 


NITG 


4 


312-315 


NRTA 


5 


357-360 


NITV 


6 


424-427 


NRTE 


7 


580-583 


NFSA 


8 


673-676 


NESL 


9 


681-684 


NHTA 



[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP- dependent protein kinase phosphorylation site 

18-21 KRAT 

[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



' of 


matches : 


8 


1 


21-23 


TLR 


2 


62-64 


TPR 


3 


220-222 


TQR 


4 


272-274 


TYR 


5 


340-342 


STR 


6 


465-467 


TKK 


7 


582-584 


SAK 


8 


757-759 


SVR 



[4] PDOC00006 PS00006 CK2__PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 19 

1 21-24 TLRD 

2 30-33 TPVE 

3 103-106 TQLE 

4 161-164 SPCE 
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5 


192 


-195 


SLWD 


6 


212 


-215 


SSSE 


7 


214 


-217 


SEAE 


8 


314 


-317 


TANE 


9 


340 


-343 


STRE 


10 


376 


-379 


SISE 


11 


398 


-401 


SPLE 


12 


403 


-406 


SDSE 


13 


445 


-448 


SSLD 


14 


453 


-456 


SAQE 


15 


478 


-481 


SNTD 


16 


514 


-517 


TAFE 


17 


613 


-616 


TKNE 


18 


705 


-708 


TLGE 


19 


763 


-766 


SSHE 



[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 4 

1 365-372 RRDEEKIY 

2 457-463 KLLETLY 

3 535-542 KEKADAIY 

4 555-561 KELDDVY 

[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of 


matches : 


14 


1 


9-14 


GAGSNM 


2 


57-62 


GMTPGT 


3 


61-66 


GTPRSS 


4 


122-127 


GCLVAL 


5 


159-164 


GVSPCE 


6 


271-276 


GTYRAT 


7 


331-336 


GMLLGG 


8 


335-340 


GGRPTS 


9 


464-469 


GTKKSC 


10 


643-648 


GWMGH 


11 


714-719 


GGLKAA 


12 


715-720 


GLKAAY 


13 


782-787 


GTLSNS 


14 


795-800 


GCPVGS 



[7] PDOC00047 PS00048 PROTAMINE_Pl 
Protamine PI signature 

776-787 ARFRVLGTLSNS 

[8] PDOC00129 PS00142 ZINC_PROTEASE 

Neutral zinc metallopeptidases, zinc -binding region signature 
645-654 VMGHELTHAF 



Membrane spanning structure and domains; 

Helix Begin End Score Certainty 

1 43 63 0.638 Putative 

2 109 129 2.142 Certain 

3 380 400 0.619 Putative 

4 736 756 0.890 Putative 
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BLAST Alignment to Top Hit: 

>CRA| 18000005141003 /altid=gi | 7662200 /def =ref | NP_055508 . 1 | KIAA0604 
gene product [Homo sapiens] /org=Homo sapiens 
/taxon=9606 /dataset=nraa /length=765 
Length = 765 

Score = 1550 bits (3969), Expect = 0.0 

Identities = 765/811 (94%), Positives = 765/811 (94%), Gaps = 46/811 (5%) 

MNVALQELGAGSNMVE YKRATLRDEDAPETPVEGGAS PDAMEVGKGAS P FS PGPS PGMT P 6 0 
MNVALQELGAGSNMVE YKRATLRDEDAPETPVEGGAS PDAMEV 

MNVALQELGAGSNMVE YKRATLRDEDAPETPVEGGAS PDAMEV 43 



GFQKGTRQLLGSRTQLELVLAGASLLLAALL 
- GFQKGTRQLLGSRTQLELVLAGASLLLAALL 74 



Query: 


1 


Sbjct : 


1 


Query: 


61 


SbjCt: 


44 


Query: 


121 


Sbjct : 


75 


Query: 


181 


SbjCt: 


135 


Query: 


241 


Sbjct: 


195 


Query : 


301 


Sbjct : 


255 


Query: 


361 


Sbjct: 


315 


Query: 


421 


Sbjct: 


375 


Query: 


481 


Sbjct: 


435 


Query: 


541 


Sbjct: 


495 


Query: 


601 


Sbjct: 


555 


Query: 


661 


Sbjct: 


615 



LGCLVALGVQYHRDPSHSTCLTEACI RVAGKI LESLDRGVS PCEDFYQFSCGGW I RRNPL 



PDGRSRWNTFNSLWDQNQAILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQP 



LRDLI EKIGGWNI TGPWDQDNFMEVLKAVAGTYRATPFFTVYI SADS KSSNSNVIQVDQS 



GLFLPSRDYYLNRTANEKVLTAYLDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITV 



PQDQRRDEEKIYHKMSISELQALAPSMDWLEFLSFLLSPLELSDSEPVWYGMDYLQQVS 



ELINRTEPSILNNYLIWNLVQKTTSSLDRRFESAQEKLLETLYGTKKSCVPRWQTCISNT 



DDALGFALGSLFVKATFDRQSKEI AEGMI SE I RTAFEEALGQLVWMDEKTRQAAKEKADA 



IYDMIGFPDFILEPKELDDVTDGYEISEDSFFQNMLNLYNFSAKVMADQLRKPPSRDQWS 



MTPQTVNAYYLPTKNE I VFPAGI LQAPFYARNHPKALNFGGIGWMGHELTHAFDDQGRE 



YDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQVNGERLNGRQTLGENIADNGGLKAAY 
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Query: 721 NAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCSVRTPESSHEGLVTDPHSPARFRV 780 

NAYKAWLRKHGE EQQLPAVGLTNHQL FFVGFAQVWCS VRT PES SHEGLVTDPHS PARFRV 
Sbjct: 675 NAYKAWLRKHGE EQQLPAVGLTNHQL FFVGFAQVWCSVRT PES SHEGLVTDPHS PARFRV 734 



Query: 781 LGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 811 

LGTLSNSRDFLRHFGCPVGS PMNPGQLCEVW 
Sbjct: 735 LGTLSNSRDFLRHFGCPVGS PMNPGQLCEVW 765 (SEQ ID NO:4) 



Hxnmer search results 

Model Description 



(Pfazn) 



Score 



E-value N 



CE 00310 E00310 neutral_endopeptidase 
PF01431 Peptidase family M13 
CE00339 E00339 vitamin_D_receptor 



456.3 2.6e-133 1 

270.4 2.4e-77 1 
4.3 1.1 1 



Parsed for domains: 
Model Domain seq- 



f seq-t hmm-f hmm-t 



score E-value 



CE00339 
PF01431 
CE00310 



1/1 
1/1 
1/1 



367 
607 
140 



398 
810 
811 



412 443 
1 225 [] 
66 798 .] 



4.3 1.1 
270.4 2.4e-77 
456.3 2.6e-133 
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1 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
1001 NNNNNNNNNN NNNNNNNNNN NNNNNNCACC TTAGACTTGA CAGGCCTGCT 
1051 TAGTCGGACT CTAAAGCACC CCTTTGCTTT TCGTTAAATA TTGCTTGGTG 
1101 TTAGTTTTTT TTCTCCTTGT AAATCTCCCA AATAAAACGG TTTGCTTTCC 
1151 CCAAGTTAGA AGTGTTAGCA CGTCTTTTCT TTAAATATCT GTGCATGGCT 
1201 GTTTTTTTCC CTGCCAATTT GTCACCATCT GTAACCCTCC CTTTATGAGA 
1251 CGATCTGATG ACAGCAGTTA TCTTGGAGAG TAGAAGTGTG GTCTTGAAGC 
1301 GCCATGGAAG AGTAGAGTCA GTGTATGCTG TGTGTGTGTG GAGTGTATGC 
1351 TCCCCCTGCA CTTGGTGTGT GTACATACAG AAACACAGTG TGCGTGTGTG 
1401 TTGGCTCTGG GTGTGTTGTG CGTGTGTACA CTGTGTGTGA GTATGCAGTG 
1451 TGTGTACATT CTGTGGGCAT CTCGTGTGTG TGTGGACTGT GTGCTGGGCG 
1501 TCGTGCCTGC CCGTGTCCTT GGCGCCTTGG CGTCTATGCG TTCTCTGCAC 
1551 ATAGGTAGGT ACCACGTGCA CACCCTGAAT GTGAGTGAAC TGCCTGTGTG 
1601 CTATGTATTT GCCGGCTGAA GAGGGGCTGT GTGGACTACT GGGGGAAGAC 
1651 GTTCCTCANG AGGGCATAAT TTCTCTAAAG TGCTTAAAGG GGATGGAGAG 
1701 AGCCTGAAAT TTGGGGGAAG TAGGCCAAGG AGTATTATCA ACGTCTGGGC 
1751 CTGGTTGAAT TTCATTACTT TTCCTAGGAA AGTAAATTAT GGGTGGCTTG 
1801 AAGGAGGGTG CTGCTGAGAT GGGGGGCGGA CCATGAAGCG TGGAGGGGTC 
1851 TCCGGTGTTG CTGGAGGGCA GCTGGAGCCT GCGGAGAGCC TCGGCGCGCT 
1901 CCTCCCTCTC CCCCACCCTC CCCCCACCCC GGGCGGGGCT CCGCGTGGGG 
1951 CGGTGGACTC GGGCGGGGGG GGGGGCGGCC GCGGCCGAGC GGGGGTGCTG 
2001 CGCGGCGGCC GTGATGGCTG GTGACGGCGG GGCCGGGCAG GGGACCGGGG 
2051 CCGCGGCCCG GGAGCGGGCC AGCTGCCGGG AGCCCTGAAT CACCGCCTGG 
2101 CCCGACTCCA CCATGAACGT CGCGCTGCAG GAGCTGGGAG CTGGCAGCAA 
2151 CGTGAGTGGG GGCCCCGGGC TCCACGGGAG GGGACTGGGT GGAGGGGGAC 
2201 GAGGCAGAGG GGTCGGCCGC GGAGGGGCAG GCGGTGCCCG GCTCGCGGAG 
2251 GTAAGGCTGC CTCCCGGGCC TGGTGGAGGG GTGATAGAGA GACCCCGGGC 
2301 CCGAGAGCAG GGCAGGTGGG AAGGGAAGGG CCCTCTTAGC AGGGCGGAGG 
2351 GGTCCGCGAG GCAGGGAGCA CTGGGGCAGG GTCGTGGGCA AATAGCCCTC 
2401 TCTGCCTGAC CTCGGTTGGC AACCCCGACT GTCTGGCAGA TGGTGGAGTA 
2451 CAAACGGGCC ACGCTTCGGG ATGAAGACGC ACCCGAGACC CCCGTAGAGG 
2501 GCGGGGCCTC CCCGGACGCC ATGGAGGTGG GCAAGGGGGC TTCCCCTTTC 
2551 TCACCAGGCC CCAGCCCTGG CATGACGCCT GGCACACCCA GGAGCTCTGG 
2601 GCTGTTCTGG AGGGTCATCT GCCCCCACCT CCGCTCCATC TCTGGCCTCT 
2651 GCTCTAGGAC TATGGTGAGG CGATGCTAAG CCGTGACGTT GCACAAAACA 
2701 GACTCAAGGC TCAACTCACT GGCTGGCCTC ATTGCCCCCG GGCCCAGAGT 
2751 TAACCCTGTG GCTCTGAAAA CTGCCTGTGG CTTCACCCTC TGGTAATCTT 
2801 GGATCCCTGC CCTGCATCTC AGTCACTCTC TGTCCCCCTG TGTTCCCCAG 
2851 GTGGGATTCC AGAAGGGGAC AAGACAGCTG TTAGGCTCAC GCACGCAGCT 
2901 GGAGCTGGTC TTAGCAGGTG CCTCTCTACT GCTGGCTGCA CTGCTTCTGG 
2951 GCTGCCTTGT GGCCCTAGGG GTCCAGTACC ACAGAGGTAG GTGGGCCCAC 
3001 ACTCTTCGTC AGTATTCATA ACTAGGGGTT CTGGAGGCCT AAGGGCCTCT 
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3051 AAGATTTTCA CTTGTGGGAA CCAAGCCTTC CCTGCAGAAA AGCCCCCGGC 
3101 TTTGCTTTCT CTTCCCAACC TTCCTGCTGT CATGGCCCTT GCAGAGTTTG 
3151 CCTCTTCCAG ACAGACAGAC TGACAGTCTC CTACCCTCCG GCCATGTTCC 
3201 CTACCACAGA CCCATCCCAC AGCACCTGCC TTACAGAGGC CTGCATTCGA 
3251 GTGGCTGGAA AAATCCTGGA GTCCCTGGAC CGAGGGGTGA GCCCCTGTGA 
3301 GGACTTTTAC CAGTTCTCCT GTGNNNNNNN NNNNNNNNNN NNNNNNNNNN 
3351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
3401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
3451 NNNNNNNNNC TTAGCAAATA GGCAGTGTCC CATGAATGAG GAAGTGGATG 
3501 GTTCTGTGAA CACTCCCAGA GGGTGGGGAG GCAGAGAGCA GGGGACTATT 
3551 GAGAAGTGCA GATGGGTTTG ATGGGGGCAG AACTCTGGGT ACAATGGAGG 
3601 GCCGCTTCTC TGCACTCTGT TTGGAGCACT GTCGTGGTGT GGTAGACACC 
3651 AGGGAGCCTG TACTGCTTAG ATATCCTTGG GTCTCCATGG ACAGGGAGAG 
3701 GAAGCCACGG CTTGCTGTTT CAGACACTCT TCCTGGGTCT GCGTTAGCAG 
3751 GACTGCTCAT TGACAAGGCA AGGAGAGAAA CCGAGCAAGG GCCAGGGACT 
3 801 CCCCCTCAGC AGTTAACGTA ATTGCCACCT GGATCCTGTG TTCTGCCCCA 
3851 CAGAAAACAC CACCTTCAAC TCCAGCAGTG AAGCTGAGCA GAAGACACAG 
3901 CGCTTCTACC TATCTTGCCT ACAGGTGGAG CGCATTGAGG AGCTGGGAGC 
3951 CCAGCCACTG AGAGACCTCA TTGAGAAGGT AGGGCCACTG AGCCGGTTGA 
4001 GGGCAGGGGA GCAGGAGAGG CCTTGAGAGA GGAGATGGCC CAGGAACGCT 
4051 TTGGGAGCTC CTGCACTAAT CATTCCACTT ATGGTCTCTA CATAGATTGG 
4101 TGGTTGGAAC ATTACGGGGC CCTGGGACCA GGACAACTTT ATGGAGGTGT 
4151 TGAAGGCAGT AGCAGGGACC TACAGGGCCA CCCCATTCTT CACCGTCTAC 
4201 ATCAGTGCCG ACTCTAAGAG TTCCAACAGC AATGTTATCC AGGTGATGAG 
4251 CTGGGAAAGG GTGGGGAGAG ACTTAGGGAC ACTTTGCTGA GCCCAGACTT 
4301 CCCTCTCCTG TGACAGGCAG GCTGGGCTGA CCCCCCGGCC CCACCCCTAC 
4351 CCCCGCTCGG GAATTCAGGT TCCCATGGTG GGGAAAGCGA GGGGCTCACC 
4401 TCCTTTCCTT GACATTGCAG GTGGACCAGT CTGGGCTCTT TCTGCCCTCT 
4451 CGGGATTACT ACTTAAACAG AACTGCCAAT GAGAAAGTAA GGAACATCTT 
4501 CCGAACCCCC ATCCCTACCC CTGGCTGAGC TGGGCTGATC CCTGTTGACT 
4551 TTTCCCTTTG CCAAGGGTCA GAGCAGGGAA GGTGAGCCTA TCCTGTCACC 
4601 TAGTGAACAA ACTGCCCCTC CTTTCTTTCT TCTTTTCTTC CTCCCTCCCT 
4651 CCCTTTCTTC CCCTTTTCCT TCCTTCCTTC CTCTTATTCT TCTAGTAGGT 
4701 TTCATAGACA CCTACTGTGT GCCAGGTCCA GTGGGGGAAT TCTGAGATAT 
4751 AAGTTTNCCG AGCCCATTGC CAGCAGGAGA GGGGATCCTT TAGAGTCGCA 
4801 CAAACAGGTC AGTCAAGTCT AAAGACNNNN NNNNNNNNNN NNNNNNNNNN 
4851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
4951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5101 NNNNNNNNNN NNNNNNNNNN GCCTGNACTT GCATGCACCG CGGTTCGGCT 
5151 NCTAGNAGNA TCCCCCCACT GCACTCCAGC CTGGGTGACG GAGAGAGACT 
5201 CCGACTCAAA AAAAAAAAAA AAAAAGAAAG AAAAAGAAAG AAGGAACAGT 
5251 TTAAACAAAA GTGTTGATGA GGCTGAGCAC AGTGGCTCAC ACCTGTAATC 
5301 CCCGCACTTT GGGAGGCTGA GGCCGGCGGA TCACTTGAGG TTAGGAGTTC 
5351 AAGACCAGGC TGGCCTACAA GGTGAAAACC CGTCTCTACT AAAAATACAA 
5401 AAATTAGCCA GGCATGGTGG TGTGCACCTG TAATCTCAGC TACTTGGGAG 
5451 GCTGAGGCAA AGAGAATCGC TTGAATCCAG GAGGCAGAGG TTGCAGTGAG 
5501 CTGAGATGGC ACCACTGCAC TCCAGCCTGG GCAACAGAAC AAGACTTCAT 
5551 CTCAAAAAAA AAAAAAAAAG TGTTGACGAG GGAAAGGCTA GGTGTGTCTG 
5601 GACCATGGCA AGGGGTCCAC TGTGGTAAAA TATAGAACTC AAGGCAGATG 
5651 AGAGGCTGGA GAGGTGGGCA GGAATGGGTT ATGGAGGGGA CCTTGAATAG 
5701 CACACTACGG AGTTTATTCT GTAGCTCCCG GAGAGCCATT GCATGCTCCA 
5751 AAGTAGGGAG GGAGCGCANT GCTTTGGGAA GTCAGTTTGT TTGGGGTGTG 
5801 AAGAGTANAT GTGAGAACNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN . NNNNNNNNNN 
6051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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6101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
6951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
7951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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9151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

9951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

10951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

11401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNAG TTCCAGGCCC 

11451 ACCCTTGGGC CAAACATGTT GAAGACCGCC ATGCTGTAGC TAGAACTTAC 

11501 AAAAGATGTA AGCCTGGGCA TAGGTGGCCG GGTGCCGTTG TGGTCGCCAC 

11551 GCTATCTTGG GGAGGGATTA AGGGCAAGGA AAATTCACCT TGAGGCCCAA 

11601 GGAAGGCACA AGGGTTATCA CGTGAAGCCG AGGATCACCA TCACCATGCA 

11651 CTAACACGCC TTGGGCAAGC ACGAAGCGAG GAGTTGCCAT CTCAAAACAA 

11701 AAACGAAAAA CAAACAAACA AAATGCTAAT CAACTGTCAT TGGTAAGGCT 

11751 TCTGGTCAAC AGTATGCTGT CAATAGTTAA GTTTTTGGGC TGGGCGCAGT 

11801 GGCTCACGCC TGTAATCCCA GCACTTTGGG AGGCCAAAGC GGGTAGATCA 

11851 CCTGAGGTCA GGAGTCGAGA CTAGCCTGGC CAACATGGCG AAACCCAGTC 

11901 TCTACTAAAA ATATAAAAAT TAGCCAGGCG TGGTGGTGGG CACTTGTAAT 

11951 CCCAGCTACT CAGGAAGCTG AGGCAGAACT GCTTGAACTG GGAAGTGGAG 

12001 GTTGCAGTGA GCCGAGATCG TGCCATTGCA TTCCAGCCTG GGCGACAAGA 

12051 GCAAAACTCC ATCTCAAAAA AAAAAAAAAA AAAAAAAGTT GTTTTTGGGG 

12101 AGTCAAAAAT GAGGCCAGGC GCAGTGGCTC ATGCCTGTAA TCACAGCACT 

12151 TTGGGAGGCC GAGGCGGGTG GATCACCTGA GGTCAGGAGT TCGTGACCAG 
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12201 CTTGGCCAAC CTGGTGAAAC CCCGTCTCTA CTAAAAATAC AAAAATTAGC 
12251 CGGGCATGGT GGCGGGCGCC CGTAATCTCA GCTACTTGGG CGGCTGAGGC 
12301 AGGAGAATTG CTTCAACCCG GGAGGCAGAG GTTGCAATGA GCTGAGATCG 
12351 CGCCACTGCA CTCCAGCCTT GGCGACAGAG GGAGACTCCA TGTCAAATTA 
12401 AAAAAAAGAC CCCAGGATTT TGGACTGTGC AGGGGTCGGT GCCCCAAACC 
12451 CCCACGTTGT TCAAGGTCAA CTGTACACTG TCATAGTCGG GAAAACTTCA 
12501 TCACTGCAGC TGCTCCTGTT TCTTGAAACC TGAAGCGGGA AACTGGATCC 
12551 TGGGACACTA CTGCCCCCTA TCGCCTGTTG GTCTTCAAAG AAATAATCCC 
12601 TTCAATTTTG CAAGGCCTGT GGTGTCATTC CCTTTTAACA GATAAGGAAA 
12651 CCGAGGCCAG GACGTGGTGG AAAATAATCA AGGTCACACA TCTATGTGCA 
12701 AAAGTGGAGT AACAACCCAG GCTCCTCATT CCCAGGTCAG TCCAGTGACC 
12751 TCAATTGACA TGAAATGTGT GAGGTCCTTC TGTGGCCCTG TGGCAGGGCC 
12801 TGAAGAGGAC AGCGTATGTA AATCAAGTCT TGTGCCTTCA TGAGTGAGGC 
12851 AGAGTAGAAA ATAACAGTAA TTCACTAGGA CCGAATCTGC ATTGTAAACA 
12901 GAGAGGAAAG GGCTAGTATT TGGCAGAAGG ATGTCAAGGA ACATTTTAGA 
12951 GATAAGAGGT GACATTTGGG TTCTGAGGGA TGAGTAGGAG TGTGCCAGGG 
13001 TGCAAAGGAT GAAAAGACAG CTCTAGCAGC TGGTAAGGGC TAAGGGGCAT 
13051 GGAGAAACAG CAAGACTTTG GGGAACTGGT AGAATTCTAA TTCTGGAAAA 
13101 TTTGAACAAG GTAATTTTTT GTGTGTGGTT AAGGTATTAC ATACATACAG 
13151 TAAAATAAAA TGCAATAGTT GCTGGGTGTG GAGGCTCACG CCTGTTAATC 
13201 CCAGTACTTT GGAAGGCAGA GGCGGGTGGA TCATCTGAAG GTCAGGAGTT 
13251 CGAGACCAGC CTGACCAACA TGGTGAAAAC CCGTCTCTAC TAAAAATACA 
13301 AAAATTACCT GGGTGTGGTG GCAGGCGCCC GTAATCCCAG CTACTTGGGA 
13351 GGCTAAGGGA GAAGAATAGC TTGAAACCCG GAGGTGGAGG TTGCAGTGAG 
13401 CTGAGATTGC ACTATTGCGC TCCAGCCTGG GTGACAAGAG TGAAAAGCTG 
13451 TCTCAAAATA AAATAAAAAT GTAATAGTCT AATTGATTTT TTTAAAAAAT 
13501 GTAGACATCC ACGTATCTAC CACCTAGGTA AAGATACTAG AGATTCCAGC 
13551 AACCTGGGAG GATCCCTCGT GCCCCTTTCA GGTCTATATG AGCCTCCACC 
13601 GTTCCCCAGT CCCCTGGAAG GAGAGGGGGT GGGAGAGGCA ACATGAAACC 
13651 TAAAAACCAG TGGGCTTCGC GCCTGTAATC CCAGCTATTG GGTTGGCTGA 
13701 GGCAGGAGGA TCACTTGCCC AGGAGTTGGA GGCTGCAGTG AGCTATGATC 
13751 GCGCCACCGC ACTCCAGCCT GGGCGACAGA TCAAGACCCC ATCTCTAAGC 
13 801 AAACAAACAA ATAAACACCC CTCAAAACCC ATGGCTTCAG GCCTGGCGCG 
13851 GTAGCTTACT TCTGTAATCT CAGCACTTTG GGAGGCCGAG GAGGGCGGAT 
13901 CACTTGAGGT CAGGAGTTCC AGACCAGACT GGCCAACATG GCGAAACCCC 
13951 GTCTCTACTA AAAAATAAAA AAAAAAAAAA ATTGGCCGGG CGCGGTGGCT 
14001 CACACCTGTA ATTACCAGCA GNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14051 NNNNNNNNNN NNNNNNNNNN NTTTTAAAGA ATGTAGACAT CCACGTATCT 
14101 ACCACCTAGG TAGAGATACT AGAGATTCCA GCAACCTGGG AGGATCCCTC 
14151 GTGCGCCTTT CAGGTCTATA TGAGCCTCCA CCGTTCCCCA GTCCCCTGGA 
14201 AGGAGAGGGG GTGGGAGAGG CAACATGAAA CCTAAAAACC AGTGGGCTTC 
14251 GCGCCTGTAA TCCCAGCTAT TGGGTTGGCT GAGGCAGGAG GATCACTTGC 
14301 CCAGGAGTTG GAGGCTGCAG TGAGCTATGA TCGCGCCACC GCACTCCAGC 
14351 CTGGGCGACA GATCAAGACC CCATCTCTAA GCAAACAAAC AAATAAACAC 
14401 CCCTCAAAAC CCATGGCTTC AGGCCTGGCG CGGTAGCTTA CTTCTGTAAT 
14451 CTCAGCACTT TGGGAGGTCA AGGTGGGCGG ATCACTTGAA GTAAGGAGTT 
14501 CAAGTACCAT CCTGGCTAAC ACGGTGAAAC CCCGTCTCTA CTGAAAAGAC 
14551 AAAAAATTTA GCCGGGCGTG GTGGCGGGCG CCTTTAGTCT CAGCTACTCG 
14601 GGAGGCTGAG GCAGGAGAAT GGCGTGAACC CGGGAGGTGG AGCTTGCAGT 
14651 GAGCTGAGAT CGCACCACTG CACTCCAGTC TGGGTGACAG AGTGAGACTC 
14701 CATCTCAAAA AAAAAAAAAA AGAAGTCAAA GTAGTAGAAA CTGCTGATAG 
14751 ACTGAATGTG GGGGGTTAGG GAGATGGAGG AAGCTGAGTG ACTCCCAGGT 
14801 TTCTTGCATG GGGGACTGAC TGGATATAAA ATTAGTTGTG GGCCGGGCAC 
14851 GGTGGCTCAT GCCTTTAATC CCAGCACTTT GGGAGGCCAA AGCGGGCAGA 
14901 TCACTTGAGC TCAGGAGTTC AAGACCAGCC TGGGAAACAT GGTGAGACCC 
14951 CTTCTGTAAG GGNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
15001 NNNNNNNNNN NNNTGACCTT TTTTTGGCTC TGNTCGGTCA CTAGCANGCA 
15051 AGTTATTGGG AGTCTACAAG ATTCTTTCAC ACTATGCCCT CAAAATTGAC 
15101 TGTTCATGTA TGTGCAGACA TATAGAAAAA CAACGGGAGC CAGGCGCGGT 
15151 GGCTCACGCC GGTAATCCCA GCACTTTGGG AGGCCAAGGC GGGTGAATCA 
15201 TGGGGTCAGG AGTTCGAGAC CAGCCTGGCC AACATGGTGA AACCTGGTCT 
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CTACTAAAAA 
CCCAGCTACT 
GGAGGTTGCA 
AGAGCAAAAC 
AAAACAACTG 
CAGTGGGATT 
TATATAACTA 
TTAAAGTATT 
ATAATAGCAG 
AAAAATAAAT 
ATATCTTTCA 
CATAATTCCA 
TGGGTAATCA 
CTGGTCTGTT 
TTTTCATAAT 
CGCTTGTAAT 
CAGGAGTTCA 
AAAATACAAA 
ACTTGGGAGG 
GCAGTGAGCC 
GACTCTGTCT 
TATAATTTGA 
AAAGCCAGCC 
GTCTCCAGGT 
CTGGGTGGAC 
CATTCAGTGC 
GGATGATCAG 
GTTTGGATGG 
TGGCTAGGGT 
GCTAGGAACC 
GATGCCATCT 
AGAGCTGGAT 
GAACTTCAGC 
TCAAGCACTG 
GGTAACCAGC 
CAGCAGTGAA 
CAAATTCTGT 
GCCTTGGTTT 
TAGGGTTGTT 
TAACATAAAT 
ACAGTTGAGG 
GCGCTATTAT 
CTTACCACAG 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 



TACAAAAAAT 
TGGGAGGCTG 
GTGAGCCGAG 
TCTGTCTCAA 
GATGTAAATT 
ATAAGAGTCC 
GAAGAAATAT 
ACCTGTAAGG 
AAAATGAGAA 
TGTGGTTAGG 
GCAAAAAAAG 
TATTTTATAT 
AAATGTTAAA 
CTTCAATTCT 
CAGATAAAAA 
CCCAGCACTT 
AGACCAACCT 
AATTACCCGG 
CTGAGGCAGG 
GAGATTGCGC 
CAAAAAAAAA 
TGAACTTATT 
ACAATCCCAG 
GCTCAGCCGT 
AGGGATGGGG 
AGGGGAAGGC 
CGAAATCCGG 
ATGAGAAGAC 
TGGGGCGCCA 
TGGGGAAGGA 
ATGATATGAT 
GATGTTTATG 
CCTGTAGAGG 
GGAAAGAGGT 
ATTGTTAAAA 
ATATAGTGCT 
TTCTACCTCT 
CATTTTCTGT 
GTGAGATAAA 
GGCCTTTAAA 
AAACTGAGGT 
CTTCTCCAAT 
AACAAANNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 



TAGCCGGGCG 
AGGCAGGAGA 
ATCGCGCCAG 
AAAAAAAAAA 
GATGAACAAA 
TTTAAAGTTG 
TTATTTATTA 
ATGTTTATGG 
CAGGTTAAAT 
ATGGGTTGTG 
TAGGTTACAA 
GCATATCAGG 
AGAACTTATC 
CAATTTTCCA 
TCATTGCACT 
TGGGAGGCTG 
GGCCAAGATG 
GCATGATGGC 
AGAATCGCTT 
CACTGCACTC 
AAAAAATCAT 
GTCAATTAAA 
TACCTTTACA 
ATTTCTTTAG 
CACCAAGGGT 
TGACTTTACC 
ACCGCATTTG 
CCGCCAGGCA 
TCTTGAGGTG 
AACAAACCCT 
TGGTTTCCCA 
ACGGGGTGAG 
GCACTGTTCC 
GCTTGTCGGT 
TGTTGGCTCT 
AACGAGCCAA 
AACTTTGTAA 
AAAATGGGGG 
TGAATACAGT 
TATTGCCAAT 
TTGGAGAGAT 
GGATTCTCTT 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 



TGGTGGCGGG 
ATCACTTGAA 
TGCACTCCAG 
AAAAAAAAGA 
TGAAGTAGTG 
TCTATGTGTT 
GGATATGATA 
TTTTTATGGC 
GTCCAACTAT 
AGGATCCTTA 
AAAATATATA 
GGAGGGAAAA 
TATGAATGAG 
AATTTTCTGT 
AGGCTGGGCG 
AGGCGGGTGG 
GTGAAACCCC 
GGGAGCCTGT 
GAACTCGGGA 
CATCCTAGGT 
TGCACTATAT 
ATGTGTACTT 
AATGGTGTTT 
TCTAGACGTT 
GGATGGGTGG 
TCCTCCCTCC 
AGGAGGCCCT 
GCCAAGGAGA 
GGGTTCAAGG 
TAACCTGGTC 
GACTTTATCC 
TACCTACGCT 
CTGGGCTTAG 
TTCTTTTAGA 
GTGACAGGCT 
GATTTGGAGT 
CCTTAACAAA 
TCCTACTAGT 
ATGTAAAAAA 
TATGGTTTAC 
ACTAATGAGT 
GCTCTCTGTC 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 



TGTCTGTAAT 
CCCAGGAGGC 
CCTGGGCGAC 
AAAGAAAAGA 
CTGCTTTGGG 
TATGTTTAAC 
ATGGATGTGC 
AATGTTGTTT 
AGGGTAAAGG 
AATGGCTGAT 
CCCTATACAA 
ACTCTAGAAG 
TGCTTTATAA 
GAATGTCCTC 
TGGTGGTTCA 
ATCACGTGGT 
AGCTCTACTA 
AATCCTAGCT 
GGCGGAGGTT 
AACACAGCCA 
TAAATTATAA 
AATTAAGAAA 
CCTTCTCATC 
CCCATTTCCC 
GGCAGGGATG 
CAGGCAGAGG 
GGGACAGCTG 
AAGTGAGCGG 
ATACAGTTTT 
TCTTCAGGCA 
TGGAGCCCAA 
CATCAGTACT 
AAATTGGGGC 
GGCAGATGGA 
GCAGGCCAAA 
CAAGCCTAAT 
ATCTCTCTAG 
GCCTTCCTCA 
ACAGCACCCA 
TAGATATTTT 
AGCCAAACTG 
TACTTCCCAA 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
NNNNNNNNNN 
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18301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18801 NNNNNNNNNN NNNNNNNNNN NNNNNAGAAT CACCAACAGC ATTGGATGAA 

18851 AATAAAGAAG AACAAGAGGT TCGTTTGAGA GGAAGCCGGG AAAATTCTCT 

18901 CGATAAAGAA ATGCAAGTGC GCGCGCGGCG CAACCACTAC AATAGTGTGT 

18951 CGTCCACCCC AGAGAGTGAA GGGGGCCCCC CCCGCCCCAA AGGAAAGGGG 

19001 TAGTGTCCAC GCCGCTCCAC AAAGAGAGAG AAGGAAAGAA GTAGTTTTCC 

19051 CCCCCCCGGG GAGAAACCTT GGATGGGGCT CANCCCCCCC TCTTTTTTTT 

19101 TCCCGCGAAA ACCCCCCCCA AAAAGTTTTT TTTAAAAAAC AAAAAAGGGG 

19151 GGTTTGGTTT TTTGGGCCCC GTGGCCCCTT TGGTTTAAAT TGGGAGAAAG 

19201 AGGGCTTAAA GGGGGGATTC AAGAAAAAAC CCCCCCCCAA TTGCCCCAAA 

19251 TTGTAATTTC CTAACCCCAA AAGGGGCCCC TAAAATTTCC GGGGAAACCC 

19301 GTGTGGGCAA TGGCCCATTA GTTTACCCAA TGCCTTTATT GACAAAGGTA 

19351 GGGCCCCATG GAGTCGTCCC CTCTAGCCTA GAATTCCCAG TGGCTCCTGC 

19401 AAGGGCCTTG GGACATTGAT GTAGCCCCAA GGGCCCTGAA GTCTGTGGAC 

19451 CAGGGCTGGT GGGGCACTGC TGCCCCCAAG AGACGAGCTC TGGTTTTGGT 

19501 GGGGTGCAAA GGTGAGTTCT CCTCAGGGCG CGAGTATGAC AAAGAAGGGA 

19551 ACTGCGGCCC TGGTGGCAGA ATGAGTCCCT GGCAGCCTTC CGGAACCACA 

19601 CGGCCTGCAT GGAGGAACAG TACAATCAAT ACCAGGTCAA TGGGGAGAGG (SEQ ID NO: 3) 
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2113- 


2151 


Intron: 


2152- 


2439 


Exon: 


2440- 


2664 


Intron: 


2665- 


2850 


Exon: 


2851- 


2986 


Intron: 


2987- 


3209 


Exon: 


3210- 


3320 
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3321- 


3519 


Exon: 
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3636 
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3978 


Intron: 


3979- 
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4242 


Intron: 
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4420 
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4487- 


4576 
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4707 
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4811 
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5525 
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5591 


Intron: 
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11595 


-11742 


Intron: 


11743 


-13150 


Exon : 


13151 


-13240 


Intron: 


13241 


-16408 


Exon: 


16409 


-16642 


Intron: 


16643 


-16747 


Exon: 


16748 


-16825 
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16826 


-18962 
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Exon: 

Intron: 

Exon: 

Intron: 

Exon: 



18963- 
19223- 
19322- 
19348- 
19527- 



19222 
19321 
19347 
19526 
19647 



CHROMOSOME MAP POSITION: 

Chromosome # 3 



ALLELIC VARIANTS (SNPs) : 

DNA 

Position Major Minor 



Domain 



Protein 

Position Major 



Minor 



2707 
4209 
4355 
15455 



A 

C 



G 
T 

T G 
A C 



Intron 
Exon 
Intron 
Intron 



289 



S 



Context : 



DNA 

Position 

2707 TGACCTCGGTTGGCAACCCCGACTGTCTGGCAGATGGTGGAGTACAAACGGGCCACGCTT 
CGGGATGAAGACGCACCCGAGACCCCCGTAGAGGGCGGGGCCTCCCCGGACGCCATGGAG 
GTGGGCAAGGGGGCTTCCCCTTTCTCACCAGGCCCCAGCCCTGGCATGACGCCTGGCACA 
CCCAGGAGCTCTGGGCTGTTCTGGAGGGTCATCTGCCCCCACCTCCGCTCCATCTCTGGC 
CTCTGCTCTAGGACTATGGTGAGGCGATGCTAAGCCGTGACGTTGCACAAAACAGACTCA 
[A,G] 

GGCTCAACTCACTGGCTGGCCTCATTGCCCCCGGGCCCAGAGTTAACCCTGTGGCTCTGA 
AAACTGCCTGTGGCTTCACCCTCTGGTAATCTTGGATCCCTGCCCTGCATCTCAGTCACT 
CTCTGTCCCCCTGTGTTCCCCAGGTGGGATTCCAGAAGGGGACAAGACAGCTGTTAGGCT 
CACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGCACTGCTTC 
TGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGGTAGGTGGGCCCACACTCTTC 



4209 CCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGAGACCT 
CATTGAGAAGGTAGGGCCACTGAGCCGGTTGAGGGCAGGGGAGCAGGAGAGGCCTTGAGA 
GAGGAGATGGCCCAGGAACGCTTTGGGAGCTCCTGCACTAATCATTCCACTTATGGTCTC 
TACATAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGT 
GTTGAAGGC^GTAGCAGGGACCTACAGGGCCACCCCATTCTTCACCGTCTACATCAGTGC 
[C,T] 

GACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGATGAGCTGGGAAAGGGTGGGGAGA 
GACTTAGGGACACTTTGCTGAGCCCAGACTTCCCTCTCCTGTGACAGGCAGGCTGGGCTG 
ACCCCCCGGCCCCACCCCTACCCCCGCTCGGGAATTCAGGTTCCCATGGTGGGGAAAGCG 
AGGGGCTCACCTCCTTTCCTTGACATTGCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTC 
TCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAACATCTTCCGAACCCC 



4355 GAGCTCCTGCACTAATCATTCCACTTATGGTCTCTACATAGATTGGTGGTTGGAACATTA 
CGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACA 
GGGCCACCCCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATG 
TTATCCAGGTGATGAGCTGGGAAAGGGTGGGGAGAGACTTAGGGACACTTTGCTGAGCCC 
AGACTTCCCTCTCCTGTGACAGGCAGGCTGGGCTGACCCCCCGGCCCCACCCCTACCCCC 
[-,T,G] 

CTCGGGAATTCAGGTTCCCATGGTGGGGAAAGCGAGGGGCTCACCTCCTTTCCTTGACAT 
TGCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTG 
CCAATGAGAAAGTAAGGAACATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGC 
TGATCCCTGTTGACTTTTCCCTTTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTG 
TCACCTAGTGAACAAACTGCCCCTCCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTT 
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15455 CACGCCGGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGAATCATGGGGTCAGGAGTT 
CGAGACCAGCCTGGCCAACATGGTGAAACCTGGTCTCTACTAAAAATACAAAAAATTAGC 
CGGGCGTGGTGGCGGGTGTCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCA 
CTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCGAGATCGCGCCAGTGCACTCCAGCCTG 
GGCGACAGAGCAAAACTCTGTCTCAAAAAAAAAAAAAAAAAAAAGAAAAGAAAAGAAAAA 
[-,A,C] 

AACTGGATGTAAATTGATGAACAAATGAAGTAGTGCTGCTTTGGGCAGTGGGATTATAAG 
AGTCCTTTAAAGTTGTCTATGTGTTTATGTTTAACTATATAACTAGAAGAAATATTTATT 
TATTAGGATATGATAATGGATGTGCTTAAAGTATTACCTGTAAGGATGTTTATGGTTTTT 
ATGGCAATGTTGTTTATAATAGCAGAAAATGAGAACAGGTTAAATGTCCAACTATAGGGT 
AAAGGAAAAATAAATTGTGGTTAGGATGGGTTGTGAGGATCCTTAAATGGCTGATATATC 



FIGURE 31 



